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A model to describe the mechanism of conformational dynamics in protein based on mat- 
^ ' ter interactions using lagrangian approach and imposing certain symmetry breaking is 

' proposed. Both conformation changes of proteins and the injected non-linear sources are 

, represented by the bosonic lagrangian with an additional 0* interaction for the sources. 

, In the model the spring tension of protein representing the internal hydrogen bonds 

is realized as the interactions between individual amino acids and nonlinear sources. 
The folding pathway is determined by the strength of nonlinear sources that propagate 
through the protein backbone. It is also shown that the model reproduces the results in 
some previous works. 
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1. Introduction 

The pathway of proteins are determined by the sequences of its amino acid con- 
stituents. The time ordered of protein folding sequence leads from the primary to 
the secondary and subsequent structures. The secondary structure consists of the 
shape representing each segment of a polypeptide tied by hydrogen bonds, van 
der Walls forces, electrostatic interaction and hydrophobic effects. It is moreover 
formed around a group of amino acids considered as the ground state. Then it is 
extended to include adjacent amino acids till the blocking amino acids are reached, 
and the whole protein chain along the polypeptide adopted its preferred secondary 
structure. 

Our understanding on the underlying above-mentioned mechanism has unfortu- 
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nately not been at the satisfactory level. For instance, the studies based on statistical 
analysis of identifying the probabilities of locating amino acids in each secondary 
structure are still at the level of less than 75% accuracy. Moreover, the main mech- 
anism responsible for a structured folding pathway have not yet been identified at 
all. On the other hand, it is known that the protein misfolding has been identified 
as the main cause of several diseases like cancers and so on.El 

Recently, Mingaleev et. al. have shown that the nonlinear excitations play an im- 
portant role in conformational dynamics by decreasing the effective bending rigidity 
of a biopolymer chain leading to a buckling instability of the chain;- Following this 
understanding, a model to explain the transition of a protein from a metastable 
to its ground conformation induced by solitons has been proposed.'^! In the model 
the mediator of protein transition is the Davydov solitons propagating through the 
protein backbone. 

At present, the most reliable theoretical explanation for this kind of the con- 
formational dynamics of biomolecules is the so-called ab initio quantum chemistry 
approach. This however requires astronomical computational power to deal with 
realistic biological systems In contrary, there are some phenomenological model 
describing the folding pathway as a result of the interplay between the energy trans- 
fer from a solitary solution that travels along the protein backbone and string ten- 
sion.l^ There are also some attempts to describe the dynamics in term of elementary 
biomatter using field theory approactP and open quantum system.^^ 

This paper follows the later approach, but starting from the first principle us- 
ing the lagrangian method to derive the responsible interactions and to clarify its 
origins. The paper is organized as follows. First, the model and the underlying 
assumptions are explained in detail. It is then followed by the derivation of rele- 
vant equation of motions (EOMs) . Summary and conclusion based on the numerical 
analysis are given at the end of the paper. 

2. The models 

The model is an extension of the toy model proposed More than considering 
a self-interaction mechanism as proposed in^" and subsequently developed in ,12121 
more realistic model is introduced. In the model, the dynamics of amino acids 
forming proteins is initially considered as a free and linear system of bosonic matters. 
Further, external nonlinear sources, like laser or light bunch, are introduced. The 
sources which propagate through the protein backbone interact each other with the 
amino acids to induce conformation changes. 

The model describes the conformation changes as the dynamics of amino acids 
using a free and massive (relativistic) bosonic lagrangian as below, 

Ic = {d.c^cf (d'-c^c) + lmljl<j>, , (1) 

where (pc represents the conformation field. The hermite conjugate is 0^ = (0*)"^ 
for a general complex field (p. On the other hand, the nonlinear sources represented 
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by the field are also governed by a massless bosonic lagrangian, 



(2) 



with an additional potential V^(0s) taking the typical 0"'— self-interaction, 



where A is the coupling constant. It should be noted that both scalar fields, (pc = 
4>cit, x) denotes the local curvature of the conformation at position x with (f>c{x) ~ 1 
or for a or /?— helix. 

The choice of interactions in Eqs. ([T]) and ([2]) are justified by the following 
considerations, 

• The conformation changes are assumed to be linear. It is actually not nec- 
essarily massive. Although one can put by hand the mass term ^J^c 
in the lagrangian as written above, the massive conformational field could 
also be generated dynamically through certain symmetry breaking as shown 
later. 

• The source is assumed to be massless concerning the laser or light source 
injected to the protein chains to induce the foldings. 

• Its non-linearity is realized by introducing the 0^ self-interaction which 
leads to the non-linear EOM. 

• For the sake of simplicity, the lagrangian is imposed to be symmetry under 
certain transformations, for instance in the present case is time and parity 
symmetry, i.e. (I)(t,x) — ?> —4>{—t,—x) for one-dimensional space. 

We should remark here that the model is although written in a relativistic form, 
after deriving relevant EOMs one can take its non-relativistic limits to obtain final 
EOMs describing the desired dynamics. Secondly, instead of using the vector elec- 
tromagnetic field Af^ to represent the nonlinear sources, like laser for instance, it is 
more convenient to consider the nonlinear source as a bunch of light or laser such 
that one might represent it in a 'macrosocopic' scalar field (j)s. 

Considering the dimensional counting and the invariance on time-parity sym- 
metry, the most general interaction between the conformation field and nonlinear 
sources is. 



with A denotes the strength of the interaction. Eqs. Q and (|H) lead to the total 
potential in the model. 



Eqs. ([T]), © and ([5]) provide the underlying interactions in the model. 

Concerning the minima of the total potential in term of source field, that is 



F(0,) = -A(0t0,)2, 



(3) 



Zint = -A(0t0,)(0t0,), 



(4) 





dVu 

90, 



tot 



's 



= 0, 



(6) 
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at the vacuum expectation values (VEV) of the fields yields the non-trivial solution, 

i^s) - /^(</>c) • (7) 

Imposing certain local symmetry, namely the phase or U(l) symmetry to the 
above total lagrangian, the VEV in Eq. ([7]) obviously breaks the symmetry. The 
symmetry breaking at the same time shifts the mass term for (f>c as follow, 

»7^i^n^i='7i^, -— ('/'c)^ (8) 

from Eq. (g]). 

On the other hand, Eq. ([7]) induces the 'tension force' which plays an important 
role to enable folded pathway in the present model. This will be discussed in the 
following section. 



3. EOMs and its behaviours 

Having the total lagrangian at hand, one can derive the EOM's using the Euler- 
Lagrange equation, 

dltot o 9ltot 



where ^tot ^ Ic + h + hnt- 

Substituting Eqs. H]), ^ and (jH) into Eq. ^ in term of 0c and 0s, one imme- 
diately obtains a set of EOMs, 



m 



2 „2 



c2-f2A02 0^ = 0, (10) 



^-;^^ + 2A<^^-3A0?)0.=O. (11) 

Here the natural unit is restored to make the light velocity c and h reappear in the 
equation. 

The last term in Eq. ([TT|) determines the non-linearity of the EOM of source. 
One should also put an attention in the last term of Eq. ([TU]), i.e. ^ k (j)c with k ~ 
2A(0s)^. This actually induces the tension force in the dynamics of conformational 
field enabling the folded pathway as expected. 

Hence, solving both EOMs in Eqs. ((TU]) and pT|) simultaneously would provide 
the contour of conformational changes in term of time and one-dimensional space 
components. 



4. Numerical analysis 

Since the EOMs under consideration involves non-linear term, one should solve them 
numerically. The numerical analysis and simulation in the present paper are done 
using the finite difference method.'^ Throughout numerical works, non-relativistic 
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Fig. 1. The discretizcd grid for solving the EOMs over the coordinate space R. 



limit V — dx/dt <^ c and the following boundary conditions for both fields are 
deployed, 



0,(O,t) =0,(L,t) = and 0^(0, : 
0s (a;, 0) = f{x) and (f)cix,0) = p{x) 



dt 



g{x) and 



dt 



q(x) 



for < t < 6 , 
for < X < i 

for < x < i 



(12) 



with /(x), p{x), g{x) and q{x) are newly introduced auxiliary functions. In finite 
difference scheme, it is more convenient to replace 4>s and 0c with u and w respec- 
tively, and rewrite them in discrete forms. Then, let us consider the coordinate 
space R = {{x,t) : < x < L,0 < t < b} discretized on a grid consisting of 
(A^ — 1) X (M — 1) rectangles with side length Ax = S and At = e shown in Fig. [TJ 
Solving the equations over the grid gives us the desired numerical solutions. 

Both coupled EOMs in Eqs. (jlOp and ([TT]) are rewritten in explicit discrete forms 
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as follows, 



2wi,i - Ui,j-i + c^e^ 



( 



6^ 



+ 2Awf 



^n^,3 - 3A<^- (;3) 



( 



+ 2AuljWi^ 




) 



(14) 



for i = 2, 3, • • • , — 1 and j — 2,3, ■ ■ ■ , M — 1. In order to calculate all values of 
Eqs. and (fT4)) . the initial values for two lowest rows in Fig. [T]must be given. 
On the other hand, the value at ti is fixed by the boundary conditions in Eq. p2|) . 
The second order of Taylor expansion can also be used to determine the values in 
the second row. Therefore, the values at are determined by, 



for i = 2,3,--- ,iV- 1. 

For the initial stage, suppose the nonlinear sources has a particular form f{x) = 
2sech(2a;) e*^^ and g{x) = 1 to generate the a-helix, while g{x) = q[x) = for the 
sake of simplicity. Then, one can obtain the initial values in this case using Eqs. (IT5|) 
and (|16p . The subsequent values are generated by substituting the preceeding values 
into Eqs. and p^ . The higher order values can be obtained using iterative 
procedure. 

The result is given in Fig. [21 The left figure in each box describes the propagation 
of nonlinear sources in protein backbone, while the right one shows how the protein 
is folded. As can be seen in the figure, the protein backbone is initially linear before 
the nonlinear source injection. As the soliton started propagating over the backbone, 
the conformational changes appear. It should be remarked that the result is obtained 
up to the second order accuracy in Taylor expansion. In order to guarantee that the 
numerical solutions do not contain large amount of truncation errors, the step sizes 
5 and e are kept small enough. Nevertheless, this should be good approximation to 
describe visually the mechanism of protein folding. 

5. Conclusion 

An extension of phenomenological model describing the conformational dynamics of 
proteins is proposed. The model based on the matter interactions among the relevant 
constituents, namely the conformational field and the nonlinear sources represented 
as the bosonic fields <j)c and (fis ■ It has been shown that from the relativistic bosonic 
lagrangian with t/)^ self-interaction, the nonlinear and tension force terms appear 
naturally as expected in some previous works.'^ 




(15) 



(16) 
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Fig. 2. The soliton propagations and conformational changes on the protein backbone inducing 
protein folding. The vertical axis in soliton evolution denotes time in second, while the horizontal 
axis denotes its amplitude. The conformational changes are on the {x, y, z) plane. 



However, the present model has different contour since the EOMs governing 
the whole dynamics are the linear and nonlinear Klein-Gordon equations. Note 
that the original model by Berloff deployed the linear Klein-Gordon and nonlinear 
Schrodinger equations. 

Moreover, the present model has inhomogenous tension force, in contrast with 
the homogeneous tension force in the Berloff's model, due to simultaneous solutions 
of Eqs. pH)) and pT|) . These lead to wrigling folded pathways as shown in Fig. [5] 
which should be more natural than the homogeneous one. 
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